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Abstract 



We introduce a multivariate Hawkes process with constraints on its conditional density. 
It is a multivariate point process with conditional intensity similar to that of a multivariate 
Hawkes process but certain events are forbidden with respect to boundary conditions on a 
multidimensional constraint variable, whose evolution is driven by the point process. We 
study this process in the special case where the fertility function is exponential so that the 
process is entirely described by an underlying Markov chain, which includes the constraint 
variable. Some conditions on the parameters are established to ensure the ergodicity of 
the chain. Moreover, scaling limits are derived for the integrated point process. This 
L£h ' study is primarily motivated by the stochastic modelling of a limit order book for high 

^ ! frequency financial data analysis. 

1 Introduction 

> 

Since two decades, the high-frequency price dynamics in a limit order book has received 
much attention. On the one hand, an extensive statistical study on the limit order book 
dynamics and limit order book information content can be found in [9, 26, 39, 19, 30, 
10, 25, 14, 41]; on the other hand, many researchers have endeavored to propose different 
models, including equilibrium model, agent-based model and Markov model, to model the 
. limit order book dynamics, see [36, 38, 17, 15, 1, 16]. 

More recently point processes have been introduced to financial application. Early 
works focus on modelling inter-event durations in [23, 24] and the effect of duration 
on price impact and trade sign autocorrelation in [20]. Dufour and Engle [22] extend 
. these models to allow for the censoring of quotes after a trade by intervening trades. 

Bauwens and Hautsch [8] provide a comprehensive introduction of the application of 
point processes in financial time series. Hawkes processes belong to the class of self- 
exciting point processes, where the intensity is driven by a weighted function of the time 
distance to previous points of the process. Hawkes processes originate from the literature 
in seismology and are used to model the occurrence of earthquakes, see [28, 27, 35, 18, 13]. 
To the best of our knowledge, the first application of Hawkes processes in financial time 
series is introduced by [11]. Since then, there is a growing literature on the application 
of Hawkes processes to the modelling of high frequency financial data, see [7, 29, 12, 
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31, 5, 2, 34, 21, 6]. Although some nonparametric estimation of the intensity functions 
has been investigated by Reynaud-Bouret and Schbath [37] and Al Dayri et al. [4] in 
dimension d = 1, the majority of Hawkes processes related publications work in the 
setting of exponential kernel functions for computational reasons. Embrechts et al. [21] 
apply marked multivariate Hawkes process to model the daily financial data. Muni Tokc 
and Pomponio [34] introduce Hawkes processes in the modelling of the trade-through. 
Bacry et al. use marked Hawkes processes to reproduce empirically microstructure noise 
and the stylized facts such as Epps effect and lead-lag in [5] and reach to derive and 
characterize the exact macroscopic diffusion limit of this model in [6] . 

In this contribution we wish to model the dynamics of the limit prices in the limit order 
book. Nevertheless, we remark that the limits evolve according to specific constraints. 
For instance the best bid/best ask (buy/sell) price's behavior is constrained by a minimal 
bid-ask spread. This means that, as soon as the minimal bid-ask spread is attained, the 
best bid price cannot increase until the best ask price goes up. Therefore, in this work, 
we propose to add a constraint variable to the classical multivariate Hawkes process to 
model the dynamics of the limit prices. We restrict ourselves in the Markov setting (with 
exponential fertility functions) and then investigate the ergodicity property and scaling 
limits of the constrained multivariate Hawkes processes. In contrast, the Markov models 
proposed in [17, 16, 1] use simple Poisson process and they do not take into account the 
cross-interaction between different events. Apart from its application to a limit order 
book, the constrained multivariate Hawkes process can be extended to many domains 
where the intensities of the process may depend on some constraint variables. 

The paper is structured as follows. In Section 2 we introduce the constrained mul- 
tivariate Hawkes processes and present the main assumptions and notation. Section 3 
gives the main results on the geometrical ergodicity and the large scale behavior. The 
application to a limit order book is presented in Section 4. We have gathered the detailed 
proofs in Section 5. Some useful technical results are postponed to Appendix A. 

2 Main assumptions and notation 

2.1 Multivariate Hawkes process with constraints 

Let p, q be two positive integers. We introduce a constrained multivariate Hawkes process 
defined by a couple (S, N), where {S(i), t > 0} is a constraint process taking its values in 

= {1, 2, . . . } 9 and N is a p-dimensional multivariate point process. We shall describe 
the dynamics of this model by the conditional intensity of N as defined in [13] and an 
evolution equation for S. 

We assume that S is right-continuous with left limits and denote by S(t— ) its left limit 
at time t. The conditional intensity of the multivariate point process N at time t shall 
depend on the constraint variable S(t— ). For notational compactness we will sec N as a 
marked point process having arrivals in R with marks in {1, . . . We will denote by 
Nj, j = 1, . . . ,p the p point processes corresponding to each mark j, namely 

p 

3=1 

In other words, each thinned unmarked process Nj is given from N by the formula 

N,(<?) = N(<7®l {j} ) , 

for all non-negative measurable function g defined on R. Here we use the notation /i(<?) 
for the integral of g with respect to a measure y,. The constraint variable S(t— ) acts on 
the point process by constraining the set of possible marks for the first event arriving 
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after time t in N. The constraint is added in the usual expression of the conditional 
intensity of a multivariate Hawkes process. More precisely, for all time t > and any 
mark i = 1, . . . ,p, the conditional intensity of Nj given N|(_ OQ . t ) and S(i— ) is set as 

if S(t-) g A, 

/*(*>*)=<„ ^ , V" /* ^(t-u) N 3 -(dtt) otherwise, ^ 

where /x (z) is the immigrant rate of mark i, is the fertility rate for producing a 
mark i event from a mark j event, and Ai, . . . , A p are products of finite subsets of Z^_, 

A s ; = Ai(l) x •• • x Ai(q). 

In turn, an arrival in N increases the value of the constraint by a value only depending 
on the mark. More precisely, the constraint process S satisfies the evolution equation, for 
all t > u > 0, 

S(t) = S(tt) + N(l (llit] ®J) , (2.2) 

where J is defined on {1, . . . ,p} with values in Z 9 . In other words, at each arrival in N 
with mark equal to i, S jump by an increment given by 3(i). 

Note that equations (2.1) and (2.2) are valid for t > and t > u > respectively so 
that the distribution of the process 

{(S(t),N i ((0,t])),t>0,i = l,...,p} 

is defined conditionally to the initial conditions given by S(0— ) and N restricted to 
(-oo,0). To summarize, this distribution is parameterized by the constrained sets Ai, . . . , Ap 
the immigrants rates /Jq(*); * = 1) • • • >P) the cross-fertility rate functions 0jj : [0, oo) — > 
[0,oo), i,j = l,...,p, and the jump increments J(i) £ Z 9 for i € {1, . . . 

Meanwhile, we define an unconstrained multivariate Hawkes process N' with the same 
fertility rate and immigrant intensity (but without constraints), that is, with conditional 
intensity on [0, oo) satisfying, for i = 1, . . . ,p, 



H'(t,i)=fi (i)+J2 ^(t-u)Nj(d«), t>0, 

j=l J u£( — oo,t) 

where N^- = N'(- ® for j = 1, . . . ,p. 

The average fertility matrix is defined by 

N = [<Xi,j]i,j=i,...,p witl:l «t,j = J ' <Pi,j, l<i,j<P- (2.3) 

The following assumption ensures the existence of a stationary version of the (uncon- 
strained) multivariate Hawkes process N', see [18, Example 8.3(c)]. 

Assumption 1. The spectral radius of H is strictly less than 1. 

Under Assumption 1, we will use the following (well defined) vector 

u=(Id p -K T )-% = ]TK%, (2.4) 

fe>0 

where l p denotes p-dimensional ones vector, 
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and Idp denotes the p x p identity matrix. 

In the following, by convention, we use bold faces symbols only when q may be larger 
than one. In the case where q is set to 1, we shall write J and S n in unbold faces since 
they are scalar valued and, similarly, we write Ai in unbold face for the constraint sets. 

For a real valued function w defined on {1, . . . we shall use the notation 

t*= [w(l),...,w(p)f . 

If w is vector-valued, we use the same notation to obtain a matrix. For instance, 7f is a 
p x q matrix. 

2.2 A very special case 

When we study the ergodicity of the process, we investigate the joint ergodicity of the 
constraint variable and of the point process. A very special case is obtained by setting 

<f>i,j = 0, i,j = l,---P, 

and 

q=l. 

In this case, we only need to study the stationarity of the constraint variable since the 
intensities are constant. Suppose moreover that J only takes values +1 and —1 and 
the constraint sets Ai are all equal to {1}. Let /+,/_ be the partition of {1, 
corresponding to the indices where J takes values +1 and — 1, respectively. In this case, 
{S(t), t > 0} is a birth-death process on Z + with constant birth rate p + = MoW an d 

iei+ 

constant death rate /i_ = MoW' By [3, Corollary 2.5], (S(t)) t >o is ergodic if and only 

iei- 

if 

, p 

A*+ -M- = 7 T Mo = J MM*) < • (2.5) 

i=l 

Here and in the following, A T denotes the transpose of matrix A. 

2.3 The Markov assumption 

We shall consider a particular shape of the fertility functions cf>i,j which implies a Markov 
property for the model. This allows us to obtain very precise results on the ergodicity 
of (S,N) and is also of interest in applications since the model is parameterized by a 
restricted set of well understood parameters. 

From now on, we suppose that there exists j3 > such that 

<j>i,j(u) = a iJ/ 3e _/3 " , 1 < i,j < p . 

This parameter is the reciprocal of a time : the larger j3 is, the shorter the dependence 
persists along the time between successive events. We denote a new process defined in 
the state space X = Z\ x R p + by X(<) = (S(t), A(*)) where X(t) = [X(t, 1), . . . , \(t,p)} T 
with, for all i € {1, . . . ,p}, 

P r 

X(t,i) = / - «)Nj(du) • 

J = 1 J(-oo.t) 

Proposition 2.1. The process {X(t), t > 0} is a Markov process. 
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This property directly follows from the exponential form of the fertility functions (f>ij 
(see [32] in the unconstrained case). In the following, we shall describe the dynamics of 
the process {X(t), t > 0} through a discretely sampled version based on the description 
of the process N using its hazard rate. This description implies Proposition 2.1. More 
importantly, in the following, we shall exclusively rely on this discrete description to study 
the crgodicity of {X(t), t > 0}. 

Let /Iq denote an arbitrary positive constant (say /ig = 1). The continuous time 
process {X(t), t > 0} is sampled at increasing discrete times (T n ) n >i defined by the jump 
instants of A(i) and by the arrivals of an independent homogeneous PPP no with intensity 
/ig. Observe that S(i) is constant between two consecutive jump instants. Hence, almost 
surely, S does not change at arrivals corresponding to the PPP no. The sample time 
instants generated by no are artificially added to avoid the periodicity of the embedded 
chain, see the end of the proof of Proposition 5.3. 

Each jump of X(t) corresponds to an arrival in the point process N. We denote its 
mark by /„, which takes values in {1, ... ,p\, where n is the positive integer such that the 
corresponding jump instant t equals T n . For other sample times T n , which thus correspond 
to the arrivals of no, we set /„ = 0. Hence the marks of the embedded chain will take 
values in the set {0, 1, . . . ,p}. 

Finally, we define Ai = T\ and, for each n > 2,A„ = T n — T„_i, S n = S(T„), 
A„ = \(T„) and X„ = X(T„) = (S„,A„). Then, given (S ,A ) and T = 0, we can 
generate the sequence (A„, S„, A„), n = 0, 1, . . . iteratively as follows. 



A n+1 =min(A° +1 ,A^ +1 ,--- ,A£ +1 ) , 

where, given Xo, . . . , X„, Ai, . . . , A„ and I\, . . . , I n , the conditional distribution of A° +1 , 
A^ +1 , ... and A^ +1 is that of positive independent variables whose marginal distributions 
are determined by hazard rates Hro, Hri, . . . , Hr p defined for each i and t > by 



(/i («) + A„(i)e-^)l Af (S n ), i>0 
p 

0tl 



i = 



(2.6) 



-pt 

Hfi(t) = \v°o+J2 1 ^ ( S «)[Mo(j) + A„(j)e 
i=i 

Then, the event type at time T n+ \ is 

I n+1 = arg min (A^ +1 ) , 
«e{o, — ,p} 

the constraint variable at time T n+ \ is given by 

Sn+i = S(T n _|_i) = S„ + J (7 n +i) , (2-7) 
where J D is the extension of J to {0, 1, . . . ,p} defined by 

f J(z) if i > 1 , , 

and, for all i € {1, . . . the self-excitation intensities at time T n+ \ are given by 

A„+i(i) = \(T n+u i) = A„(i)e^^ A "+ 1 +/3a Un+1 . (2.9) 

Observe that, for all T n < t < T n+ i, the continuous time process is interpolated as 

S(t) = S„ 
A(t,i) = \ n {i)e- pt . 
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For any probability distribution v on X = 7L q + x PJj_, we denote by F u the probability 
corresponding to the initial distribution v at time t = 0. Notation E" corresponds to the 
corresponding expectation. If v is a Dirac distribution at point x G X, we will simply 
write P x and E x . 

The above facts thus imply that X„ = (S n ,A„), Y n = (7„,S„,A„) and Z„ = 
(A n , I n ,S n , A n ), defined for n = 0,1,... are Markov chains respectively valued in X = 
Z q + x M. p + , Y = {0, 1, . . . ,p} x Z q + X MP + and Z = l + x{0,,.,p}xZ^xI p + . We shall 
denote by Q, Q and Q the transition kernels of these Markov processes, respectively. 

Remark 1. Observe that, from the description above, the conditional distribution of 
(Ai, ii, Si, Ai) given (Ao, Iq, So, Ao) does not depend on (Ao,io)- In particular, the whole 
path {Z„, n > 1} onZy depends on the initial condition set on Xo and w;e may write, for 
all z = (<5, i,x)GZ and a// A G B(Z), 

Q(z,A) = P X ((A 1 ,/ 1 ,X 1 ) € A) . (2.10) 

Clearly the same remark holds for Q, namely, Q(y,A) = P x ((Ji,Xi) G A) for all y = 
(i,x) GY and B(Y). 

Because the ergodicity of Q will be proved using an induction on the number of 
constraints g, we need to introduce further Markov chains and transition kernels. To 
initiate the induction, we define the Markov chain {(/„, A„), n > 0} as the chain valued 
in {0, 1, . . . ,p} x M. q + that starts at the same state as (Io, Ao) but with transition kernel 
Q, which is defined as Q but with all the Aj's replaced by the empty set and without 
the constraint variable. We will call this chain the unconstrained Hawkes embedded chain, 
since it is associated to a classical (unconstrained) multivariate Hawkes point process. 
This chain corresponds to the case g = 0. 

Remark 2. As in the case q > 1, the conditional distribution of {(/„, A„), n > 1} 
given the initial condition (Iq,\q) does not depend on Iq- Hence we will use the notation 
or to underline this fact, for instance, for any y = (i,£) G {0,1,..., p} x PJ|_ 7 
j G {0, 1, . . .,p} and Borel subset A C M+ 7 

Q(y,{j}xA)=P £ (I 1 = J ,\ 1 eA). 

Furthermore, for any J C {1, . . . , g}, we denote by Q the transition kernel defined 
on {0, 1, . . . ,p} x Z 9 i T^ J x defined as the transition kernel Q but without the constraint 
variable Sj, j G J and their corresponding constraint sets Ai(j'), . . . , A p (j). In particular, 
we have Q^~^ = Q and Q^ - ^ 1 '"'' 9 ^ = Q. Meanwhile, for any J C {1, . . . , g}, we denote by 
Q( +J > the transition kernel defined on {0, 1, . . . ,p] x Z* J x defined as the transition 
kernel Q with the constraint variable Sj, j G J and their corresponding constraints 
Ai(i),...,A p (i). In particular Q(+Ib-,9» = q and q(+0) = g. 

3 Main results 

We now present the main results of this work. All the results will rely on the Markov 
assumption introduced in Section 2.3. Under this Markov framework, in Section 3.1, we 
show that the kernel Q and Q defined in Section 2.3 are -0-irreducible and aperiodic. We 
provide a partial drift condition in Section 3.2 and then prove that Q is ^-geometrically 
ergodic in Section 3.3, where V is unbounded off petite sets. Then we study the ergodicity 
in the case q = 1 in Section 3.4. The general case is presented in Section 3.5. Finally, we 
determine the scaling limit of the point process in physical time in Section 3.6. 
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3.1 Irreducibility 



Because of the constraints sets Aj and the function J, the path of the process S cannot 
evolve arbitrarily. The following definitions will be useful. 

Definition 1. Let m be a positive integer and s G Z+, the set of admissible paths _4 m (s) 
is defined as the set of (ji, • • • , j m ) G {0, . . . ,p} m such that 

m— 1 

s g A£, s + J (ji) e A c h , ■ ■ ■ , s + J oUn) e A c jm . (3.1) 

n=l 

An admissible path (ji,-- - ,j m ) € An(s) implies that, for any i = 0,1,..., p and 
£ G (0, oo) p , given Y = (i, s, f) the conditional probability to have I\ = j\ , . . . , I m = j m 

k 

(and thus, by (2.7) 8^ = 8 + J (jn) for all k = 1, 2, . . . ,m) is positive. 

n=l 

The main assumption of this section is the following one. 

Assumption 2. The fertility matrix H defined in (2.3) is invertible. Moreover, if q > 1, 
£/iere exists s Q G Z+ smc/i t/iat the following assertion holds. For all integer K > 1, f/iere 
exists an integer m > p + 1 suc/i f/ia^ /or all s G {1, . . . , A"} 9 , f/iere exists an admis- 

m 

sible path (ji, ■ ■ ■ ,j m ) G A m (s) such that s + ^ J o(jn) = s and {j m _ p+1 , ■ • ■ , j m } = 

n=l 

{1, • • • ,p} (possibly in a different order). 

We refer to [33] for the definition of small sets and ^-irreducibility. The assumption 
on the invertibility of N will be useful for proving the -^-irreducibility of the embedded 
Markov chain. The second assumption says that there exists an admissible path (see 
Definition 1) with the last p steps containing the p different marks. We will check this 
condition for the application of the constrained Hawkes process to a limit order book in 
Section 4. It will be used to establish the existence of small sets for Q, see Proposition 5.3. 

We can now state the main result of this section. 

Theorem 3.1. Let Q and Q be the transition kernels defined in Section 2.3 on the spaces 
ZjJ_ x MP, and {0,1,..., p} x Zi. x respectively, with p > 1 and q > 0. Suppose 
moreover that Assumption 2 holds. Then then kernels Q and Q are aperiodic and ifi- 
irreducible. Moreover for all K > 1 and M > 0, {1, . . . , K} q x (0, M] p and {0, . . . ,p} X 
{1, . . . , K} q x (0, M] p are petite sets for Q and Q, respectively. 

The proof is postponed to Section 5.1. 
3.2 Partial drift condition 

We now derive a partial drift condition that will be useful to obtain a "complete" drift 
condition on the process {X(t), t > 0}. In the following result, the drift condition is 
partial in the sense that it does not control S n . It only says that, under Assumption 1, 
independently of the process S„ , A„ should not have large excursions away of a compact 
set. 

Proposition 3.2. Let Q be the kernel defined in Section 2.3 on the space 1 q + x W + with 
p > 1 and q > 0. Suppose that Assumption 1 holds. Then there exists 7 > 0, 9 G (0, 1), 
M > and b > such that, for all s G 1\ and i G M!J_, we have 

[Q(l z « ® V ln )](s,£) < 9 Vi„(t) + bl (0>M]P (£.) , (3.2) 

where 

V ln (£)=e^ Te , (3.3) 

with u defined as in (2.4). 
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The proof is postponed to Section 5.2. 



3.3 The case q = 

If q = 0, the process N is a standard (unconstrained) multivariate Hawkes process. In 
this case the process {A„, n > 1} is a Markov chain (the constraint variable S„ vanishes) 
and the "partial" drift condition of Proposition 3.2 becomes a "complete" drift condition 
for this chain. Moreover, if q = in Theorem 3.1, Assumption 2 boils down to assuming 
that the fertility matrix H defined in (2.3) is invertible. We also note that the sublevel 
sets of Vi i7 are small sets by Theorem 3.1. Hence we obtain that {A„, n > 0} is V\ n - 
geometrically ergodic, see [33, Chapter 15] (and also [33, Theorem 9.1.8] to get the Harris 
recurrence). In fact the same result holds on the extended chain {(/«, A„), n > 0}, whose 
transition kernel has been denoted by Q in Section 2.3. 

Proposition 3.3. Let Q be the transition kernel defined in Section 2.3 on the space 
{0, l,...,p} x Suppose that H is invertible and that Assumption 1 holds. Then there 
exists 7 > such that Q is (t{o,..,, p \ <8 V\ a )- geometrically ergodic, where Vi l7 is defined 
in (3.3). 

Proof. As explained above, Theorem 3.1 and Proposition 3.2 apply to the case q — and, 
as a consequence, {A„, n > 0} is a Vi i7 -geometrically ergodic. This conclusion also applies 
for the kernel Q of the chain {(/„, A n ), n > 0} with V\ n replaced by (1{o,...,j>} ® ^1,7) 
because the conditional probability of given does not depend on I . □ 

Having proved the ergodicity of Q, we can establish the following result on its station- 
ary distribution. It will be used to obtain the case q = 1. 

Corollary 3.4. Under the same assumptions as Proposition 3.3, denote by tt the sta- 
tionary distribution of Q. Let w : {!,... ,p\ — > K. Let w be the extension of w to 
{0, 1, . . . ,p} obtained by setting w o (0) = 0. Then we have 

v^ T (Id„ - N)"Vn 

The proof is postponed to Section 5.3. 

3.4 The case q = 1 

We shall further denote 

s* = 1 + max I |J Aj \ . (3.5) 

v 1 >■ I 

In the case q = 1, we need a drift function that applies to X„ = (S n ,\ n ) which is 
unbounded off petite sets, that is, the drift function must diverge as at least one component 
of X„ goes to infinity, while, in Proposition 3.2 the drift function goes to infinity only 
when one of the components of A„ goes to infinity. To this end we define 

W0 = e 7S - (3.6) 
We introduce the following assumption. 
Assumption 3. The following inequality holds 

^ T (Id p -H)~Vo < • (3.7) 
Note that Condition (2.5) corresponds to Assumption 3 in the special case H = 0. 
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Theorem 3.5. Let q = 1 and p > 1 and suppose that Assumption 1, 2 and 3 hold. Let 
Q be the kernel defined in Section 2.3 on the space {0, 1, . . . ,p} x Z + x with p > 1. 
Then, for all 71 > small enough, there exists 7q > smc/i i/iai ; /or aH 70 G (0,7q], Q * s 
(l{o,...,p} ® Vb l7o ® ^1,71)- geometrically ergodic, where Vo j7o and Vi j7l are defined in (3.6) 
and (3.3). 

The proof of Theorem 3.5 is omitted as it is a particular case of Theorem 3.7, which 
is proved in Section 5.5. 

Assumptions 1 and 2 appear to be very mild. Assumption 1 is related to the stability 
of the underlying unconstrained Hawkes process. Assumption 2 is used to obtain small 
sets for the chains Q and Q. A natural question is to ask whether Assumption 3 ~f T (ld p — 
N) _1 /x < is sharp. The following theorem partially answers to this question. 

Theorem 3.6. Let {(S n ,X n ), n > 0} be the Markov chain on the space Z + x with 
transition kernel Q defined in Section 2.3. Suppose that Assumptions 1 and 2 hold. Then 
Q is ip -irreducible. Moreover the two following assertions holds. 

(i) If ip({s*, s* + 1, . . . } x (0,oo) p ) = 0, then Q is (l{i 8 *_i} <8 Vi,^)- geometrically 
ergodic, where s* is defined by (3.5). 

(ii) Otherwise, if 

J^ T (Id p -N)-Vo >0, (3.8) 

then Q is transient. 

The proof is postponed to Section 5.4. 

Remark 3. The fact that Q is ip -irreducible follows from Theorem 3.1. Then, by defini- 
tion of v in the proof of Proposition 5.3, Assumption 2 implies that ^({so} x (0, oo) p ) > 0. 
Thus if So in Assumption 2 satisfies sq > s* , then the case (i) does not happen. 

3.5 General case 

To obtain a drift function in the general case q > 1, we shall use the function Vb l7 denned 
by (3.6) applied multiplicativcly to each component of S„ and the function Vi. 7l defined 
by (3.3) applied to intensity A„. 

We shall see that the proof of the ergodicity for q > 1 relies on an induction on q, sec 
the details in Section 5.5. For q = 1 the induction applies under the simple Assumption 3 
because, using Corollary 3.4, it allows us to compute some moment under the stationary 
distribution of the unconstrained chain (with kernel Q). In general the induction is more 
involved. We shall need some additional notation. For any set J C {1, . . . ,q}, we define 
J J :{l,...,p}->Zby 

J^(0 = E J i( f )- (3-9) 

We further denote by J% the extension of J J on {0, . . . ,p\ defined by J^(0) = 0. 

Theorem 3.7. r Let q > 1 and p > I, and suppose that Assumption 1 and 2 hold. Let 
Q be the kernel defined in Section 2.3 on the space {0, 1, . . . ,p} x "L q + x MF + . Define, for 
all J C {1, . . . , q}, the kernel Q^ + ^ as in Section 2.3. Check the following conditions in 
this order : 
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for k = 1,2,3, ... ,q do 
for J" C {1, . . . , q} such that #J = k do 
Check that 



~f J ) (Id p -N)-Vo<0 



(3.10) 



for J' C J with J' 
Check that 



end 



{0, J} do 



lz* xl 



< 



(3.11) 



Then, for all 71 > small enough, there exists 7g > such that for all 70 S (0>7oL 

j s p j ® Vi^)- geometrically ergodic. We denote by -ir^" 7 ^ its stationary 

distribution. 
end 

end 

Then, for all 71 > small enough, there exists 7q > sitc/i i/iai /or aZZ 70 G (0,7g], Q 
is (l{o,...,p} ® K?7o ® ^.71) -geometrically ergodic. 

The proof is postponed to the end of Section 5.5. 

Remark 4. By Corollary 3.4, Condition (3.10) is equivalent to (3.11) with J' = 0. 

Remark 5. Observe that Condition (3.11) makes sense at this step. Indeed, since 4^J = 
k, and ^J' < k, we have already checked that ' is geometrically ergodic. Thus the 

stationary distribution tt^^ ' is well defined. In practice, one may check this assumption 
using Monte Carlo simulations, since the law of large number holds in this case (see [33, 
Theorem 17.1.7]). 



3.6 Scaling limit 

In the application of this model to the limit order book, an increment of the best bid price 
or the best ask price at time is specified by the mark 1^ € {0, . . . ,p}, see the details in 
Section 4. It is interesting to investigate the microscopic behavior of the mid-price (mean 
of the best bid price and the best ask price) at large scales, see [5] for the non-constrained 
case. 

Suppose that the Assumptions of Theorem 3.7 hold, then Q is (l{o,...,p}®^o 9 7o ®^i.7i) _ 
geometrically ergodic for some 70,71 > and Vo no and V\ ni defined in (3.6) and (3.3). 
Applying the results of [33, Theorem 17.4.2], we obtain a functional central limit theorem 
(FCLT) in discrete time for the chain Q. To obtain a result in (physical) continuous time, 
we actually need a functional CLT for the kernel Q defined in Section 2.3. Hence we must 
first check that Q is geometrically ergodic. 

Let us define Vi,^ '■ K + — > R + by 

V 2n (S) = e T* . (3.12) 

We provide the ergodicity property of Q in the following corollary. 

Corollary 3.8. Let q,p > 1 and Q be the kernel defined in Section 2.3 on the space 
M+ x {0, 1, . . . ,p} x 7j q + x W + . Suppose that the Assumptions of Theorem 3.7 hold. Then, 
for all 71 > small enough, there exists 7q > such that for all 70 € (0, 7q] and all 
72 > small enough, Q is (V2 l72 ® l{o,...,p} ® ^o-y a ® V\ ni )- geometrically ergodic. 
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The proof is postponed to Section 5.6. 

By Corollary 3.8, Q admits a stationary distribution 7f. We shall use the notation E 
for the expectation under the stationary distribution. 
Let now g : Z — > R be such that for all 70, 7i, 72 > 0, 

SU P 777 Z 7775 77 TTT < 00 ■ (6.13) 

zez [V 2 , l2 ® l{o,..., P } ® V q lo ® 7i, 71 ](z) 

By Corollary 3.8, under this condition, we have 7f(|g|) < oo. Let g = g — 7f(g). Using 
Corollary 3.8 and [33, Theorem 17.4.4], for all 70,71,72 > 0, there exists R > such that 
the Poisson equation g~Q{g) = g (see [33, Chapiter 17]) admits a solution g satisfying the 
bound \g\ < R ((V2, 72 ® l{o,...,p} ® ® ^.71) + •"•)• Moreover, we have, for all z 6 Z, 

00 



.9(z) = ( z '-9) • 



fc=0 

It also follows that 7r(|<?| 2 ) < 00. Let us denote the linearly interpolated partial sums of 
g(Z n ) by 

s n (t,g) = S[™tj(.9) + (nt- \nt\) [E |_„ t j +i (5) - T,y nt \{g)] , 

k 

where £fc(/i.) = fe(Zj). Applying [33, Theorem 17.4.4], if the (nonnegative) constant 

j'=i 

a g 2 :=7f(.g 2 -{Q.g} 2 ) (3.14) 
is strictly positive, we have, for any initial distribution, as n — > 00, 

{na 2 g )~ 1 ' 2 Sn {t,g)AB t , (3.15) 

where -4 denotes the weak convergence in C([0,1]) (the space of continuous functions 
defined on [0, 1]) and B denotes a standard Brownian motion on [0, 1]. 

Remark 6. Note that a 2 may not be strictly positive. An interesting example is given by 
g = 1r + ® J Q ® 1 Z 8 ® l R p . In this case, we have T* n (g) = S„ — So and thus 

n 

£„(£) = S„ - S - E(S„) + E(S ) + £l(J (J fc )) - 7f(ff) , 

fe=i 

Hence, we get that = 0p„(l) J which implies a 2 = 0. 

We now derive the main result of this section, which determines the scaling limit of 
the constrained Hawkes process. We need some additional notation before stating the 
result. Let w : {1, . . . ,p} —> R. Define 

£(w) = «M . (3.16) 

Further define 

w(w) - a] , (3.17) 

where a 2 is defined as in (3.14) with g : Z — >• M. defined by <?(z) = w(i) — £ (w)<5 for all 
z = (S, i, s, £). 

Theorem 3.9. Under the assumptions of Theorem 3.7, if v(w) > 0, we have, for any 
initial condition, 

T- 1 ' 2 (N(l [0 , tT ] 0w)-tT £(w)) 4 (^wVtlAx]) 1 / 2 S t . (3.18) 
The proof is postponed to Section 5.7. 
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4 Application to a limit order book (LOB) 



Let us apply our results in the Markov setting to a simple limit order book where only 
the two best limits are considered (Best Bid and Best Ask) . Figure 1 presents a snapshot 
of the price evolution in physical time and event time. 




Figure 1: Limit order book of Orange : Evolution of Best Bid and Best Ask prices over 
a short time window of April 1, 2011. The evolution is displayed in physical time (left) 
and in event time (right). 

In this example, for sake of simplicity, we focus on the following four best limit events : 

• Event 1 : Best Ask price moves upward one tick, 

• Event 2: Best Ask price moves downward one tick, 

• Event 3: Best Bid price moves upward one tick, 

• Event 4 '■ Best Bid price moves downward one tick. 

In order to forecast future events based on past observations of the LOB, Hawkes processes 
seem to be a sensible approach to model the time evolution of these events. However, since 
the Best Bid has to remain at least one tick below the Best Ask, we need to introduce a 
constraint. Namely defining the spread as S = Best Ask — BestBid (in ticks), events 2 
and 3 do not happen whenever S = 1. 

Hence we propose the constrained multivariate Hawkes process described in Section 2.1 
to model this data. This case corresponds to p = 4, q = 1 and the constrained sets Ai are 
defined by 

*-{'•, i = \ ot \ <"> 

Ml} , i = 2 or 3 

The spread variable S evolves according to (2.2) with a scalar J defined by 

1 = 1 ° r 4 (4.2) 
i = 2 or 3 

The Markov assumption of Section 2.3 allows us to use a finite set of parameters for this 
model, namely /?, H and /x . 
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Assumptions 1 and 3 then amount to assumptions on the parameters fj, and H. As- 
sumption 2 says that H has to be invertible and requires an additional property which 
only depends on the constrained sets Ai and J. Hence, to apply Theorem 3.5, we only 
need to check that this additional property holds. We set s a = 2. For any K > 1, we set 
m = \K — 2| + p. Then given s € {1, . . . , K} we need to show that there is an admissible 

rn 

path (j'x, . . . ,j m ) € A m (s) such that s + y^ J (jk) = 2(= s ). To see why, let us set, for 



fc=l 



1< i < Is- 21 



if s > 2, 
of s = 1. 



Then, by (4.2), wc have s+ </(jfc) = 2. To conclude, we set ji = for |s— 2| < i < m— 4 



k=l 



and j m -3 = 1, j m _ 2 = 4, j m _i = 2 and j m = 3, so that, by (4.2) and (2.8), s+J^ J (jk) = 

fe=i 

2. Moreover, we easily check that (3.1) holds for this choice of (ji, ■ ■ ■ ,jm) an d that 
{jm-3,jm-2, jm-i, jm} = {1, • • • , 4}. Hence Assumption 2 holds and Theorem 3.5 applies 
provided that H is invertible and has spectral radius smaller than 1, and fi satisfies (3.7) 
with J defined by (4.2). The crgodicity of the underlying chain allows one to perform 
meaningful statistical analysis of the data. This will be done in a forthcoming paper. 

For the moment, we focus on the scaling limit of the mid-price, defined as the middle 
price between the Best Bid and Best Ask prices. This value is often considered as the 
continuous time price of the asset. In the framework of our model, the mid price satisfies 
the following equation 

P(t) = P(u)+N(l M ®w) , (4.3) 

where w(i) takes values 1/2, —1/2, 1/2 and —1/2 for i = 1, . . . , 4, respectively. (Note that 
similar equations hold for the Best Bid and Best Ask prices with different functions ws.) 

Applying Theorem 3.9, we get the scaling limit of the mid-price in physical time is 
given by 

T- 1 ' 2 (P(tT) - P(0) — tT£ H) 4 (v(w) /EfAi]) 1 / 2 B t , 

where £ (w) and v (w) are respectively defined in (3.16) and (3.17), provided that v (w) > 
0. Although it does not seem easy to check that v (w) > (except perhaps by numerical 
means), we do expect this to be true. Indeed, while the spread behaves as a station- 
ary variable, yielding a vanishing asymptotic variance in the large scale behavior (see 
Remark 6), the best-bid, best-ask and mid prices behave as co- integrated random walks. 

5 Postponed proofs 

5.1 Proof of Theorem 3.1 

In the following we shall denote 

-p^ = max/i (j), ^io = min/x (j), (5.1) 

3 3 

a = max ai j , a = min oti j , (5-2) 

i,3 ' i,3 

J = max \ | 3 i.A , (5-3) 

i— l,....p^- — ' 

' i=i 
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and the j-th column vector of H by Kj, so that H = (Hi . . . To establish the 

existence of small sets, we need two preliminary results. 

Lemma 5.1. Let x = (s, £) G X. j G *4i(s) with j ^ and j:X-) R + . Then 

E x [g(\i)l {Il=j} ] > Mo / g(e- tp £ + ^e^+^dt , (5.4) 
Jo 

where I = max£(i). 

i 

Proof. Using (2.9), we get that 



[$(Ai)l {/l=i} ] = E X Ue-^i + ^j)!^^ 



where U( = mmA\. 

«¥i 

. St 

Observe that U( > Exp uto + £) where Exp(z) denotes the exponential distribution 
with mean — , thus, since Aj and U( are independent, we have 

E x [g(X 1 )l {Il=j} ] > E x \g(e- A ^e + /3^)e-^ +1 ^ 

Since j G _4i(s), using that the hazard rate of Aj is bounded between fj,Q and JIq + £, we 

see that the density of Aj is bounded from below by t G R_i_. Hence we 

get (5.4). □ 

Corollary 5.2. Lei m be an integer > 1, M G R+ and g : X — >• M + . Lei x = (s,£) G X 
wrei/i £ G (0, M] p and (ji, ■ ■ • ,j m ) G .A m (s) wif/i j, 7^ /or i = 1, . . . , m. Then, we have 

E x [s(A m )l {Il=il ,..., Jm= , m} ] 

> Mo™ / S[e~ Er=1 '^Z + e~ ^ ti0 mh +■■■ + J 

x e ~ Cm ' M 4i di m • • • dti , 

where C rn ,M = 2(/2o + M + (m — l)/3a). 

Proof. The case m = 1 is given by Lemma 5.1. 

To get the result for any m > 2, we proceed by induction. Let x = (s,£) G X with 
£ G (0, M} p and (j u ■ ■ ■ ,j m ) G {1, . . . ,rf m , if (ji, . . . ,j m ) G An(s). We have 

E x [g(\ m )l { i 1=n ,... Mm} ] 

= E x [l {/l=Jl} E[ 5 (A m )l {j2=j2 ,.. Jm=Jm} |(/ 1 ,X 1 )]] . (5.5) 

Recall that given (Li,Xi), the distribution of {(Jfc,Xfc), fc > 2} does not depend on L, 
hence, with the Markov property, we obtain that 

E[g(X m )l {l2=j2> ... Mm} \(h,X 1 )} =E Xl [<?(A m _i)l {Jl= , v .. jJra _ 1=Jm} ] . 

Note that, using (2.9) and (5.2), {X = x} and £ G (0, M) p implies Ai G (0,M + /3a] p . 
Thus, in the event {X = x}, the right-hand side of the last display can be bounded from 
below by applying the induction hypothesis with (ji, ■ ■ ■ ,j m _i) replaced by (j'2, ■ ■ • , j m ), 
£ = Ai and the constant 



C m -x,M+ps = 2(/Zb" + M + pa + (m - 2)pa) = C„ 



,M 
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Plugging this bound in (5.5), we get that 
E x [g(\ m )l {Il=Ju ...j m ^ m} ] 



> 



E x 



x e 



-c„ 



771 — 1 



• -dti 



(5.6) 



where the omitted argument of g reads 

[..^e-EE^^ + e-ES 1 * 
Using again Lemma 5.1, we have 



E x [l {Jl=Jl}5 [...]] >/£o 



> 



/'() 



+ ••• 

e - 2 (^+Z) tdt 



.'le- G 



(5.7) 



where the omitted argument in the right-hand side reads 

[. . .'] = e" E^i 1 u \e- l H + ) + e~ tl/3 /3H j2 + ■ ■ • + /3H Jm 
= e - Er=l' W-tfg + e - t./s^ + • • • + /3N Jm . 

Hence we get the result by applying the Tonelli theorem in (5.6) and then using (5.7). □ 

Proposition 5.3. Let Q be the transition kernel defined in Section 2.3 on the space 
x R^J_ withp,q > 1. Suppose that Assumption 2 holds. Then there exists a probability 
measure v on X such that, for all K > 1 and M > 0, there exists e > and a positive 
integer m such that {1, . . . , K} q x (0, M] p is an (to, e, v ) and (to + 1, e, v) -small set for Q. 

Before providing the proof of this result, let us state the following corollary which 
follows by observing that the conditional distribution of (ii,Si,Ai) given (Jo,So,Ao) 
docs not depend on Iq. 

Corollary 5.4. Let Q be the transition kernel defined in Section 2.3 on the space {0, 1, . . . ,p}x 

x wl_ withp,q > 1. Suppose that Assumption 2 holds. Then there exists a probability 
measure v on X such that, for all K > 1 and M > 0, there exists e > and a positive 
integer m such that {l,...,p}x{l,..., K} q x (0, M] p is an (to, e, v) and (m+1, e, v) -small 
set for Q. 

Proof of Proposition 5.3. Let K > 1 and M > and denote C = {!,..., K} q x (0, M] p . 
Let x = (s,£) E C. By Assumption 2, there exists an integer m > 0, and an admissible 

rn 

path (ji, ■ • • ,j m ) such that s + J^ J(j„) = s Q and {j m - p +i, • • • ,j m } = {!,•■■ ,p}, where 



71=1 



s G Zi is independent of to, M and -ftT. 
Now, for all subsets A g B(Z\) and £? e 

Q m (x,A x B) = E x [U(S m )l B (A m )] 



), we may write 



> E x 



l A {S m )l B {X m )Y[l {u 



■-ii) 



1a(s d )E x 



771 — p 



l B (Ap)IJl 



(5.8) 



15 



where we used that, given Yj, i = 0, 1, . . . , m — p, the conditional distribution of Yj, 
j > m — p only depends on X m _ p and the Markov property. 

Take now K > 1 and M > and let x G {1, • • • , K} q x (0, Mf. By (2.9) and (5.2), 
under {Xo = x}, we have A m _ p G (0, M'] p where M' - M + a/3(m — p). Moreover, 
under {Xo = x, Li = ji, i = l,...,m — p}, the last p-steps path (j m _ p+ i, - ■ ■ ,j m ) is 
admissible, that is, (j m - p +\, ■ ■ ■ ,j m ) G A p (S m - p ). Applying Corollary 5.2, we get that, 
under {Xq = x, i, = ji, i = 1, . . . , m — p}, 



is(A m ) n i {Ii=ii 

i— m— p+l 



> Mo p / Is (e- Sf=i **** + e" Sf= a ti P/3'tij 1 +■■■+ /3H Jp ) 

x e - c ™,M' Ef=i **di p ■ ■ • dti , (5.9) 



where C m , M ' = 2(/^+M' + (m- l)a). Setting u = e^^-i*', i? = [i9(l), • • • ,tf(p-l)] T 
where i9(j) = c~ ^=j+i ti/3 for j = — 1, Inequality (5.9) yields, under {X = 

x,Ij =ii, i = l,...,m-p}, 



ib(a to ) n i { i j=i , 

z— m— p+1 



>^o p —— / 1 b (m£ + /3Hi9) u^^dtfdw, (5.10) 

where Z? = {(u, ■&) : < u < i9(l) < • • ■ < i9(p — 1) < 1}. Applying Lemma A.l with 
C 1 

r = /3H and 7 = — ^ — in (5.10), we get that there exists ei > and a probability 
measure V\ such that, under {Xo = x, Ij = j», i = 1, . . . , m — p}, 



Is (Am) II X KU=U 
i=m—p-\-l 



(5.11) 



Moreover, it is important to note that v\ only depends on (3 and H. 
Using Inequalities (5.8) and (5.11), we get that 



Q m (x, A x B) > eil A (s ) ^i(B)E x 



'm—p 

n 



(5.12) 



Using that x G {1, • • • , /C} 9 x (0, M] p and Corollary 5.2 with <? = 1, we further have that 

~m—p 



> 



Mo 



From Inequality (5.12), we get that Q m (x, Ax B) > ev(A x Z?) for some positive constant 
e not depending on x and v is the measure on X defined by v(A x B) = 1a(s )l>i(B), 
which docs neither depend on M nor K. In other words, C is a (m, e, i/)-small set (see 
[33]). 

To obtain that C is also an (m + 1, e, ^)-small set (by possibly decreasing e), we simply 
observe that we can carry out the same proof as above with m replaced by m + 1 and the 
sequence of marks (ji, ■ ■ • ,j m ) replaced by (0, ji, • • • , j m ). □ 
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Proof of Theorem 3.1. In Proposition 5.3, the measure v does not depend on the set C, 
which can be chosen to contain any arbitrary point x in the state space X. Hence the chain 
is ^-irreducible with <f> = v. Moreover the set C is (m, e, i/)-small and (to + 1, e, ^)-small; 
hence, the chain is aperiodic, sse [33, Section 5.4.3]. These arguments hold for the Kernel 
Q and thus we obtain Theorem 3.1. □ 



5.2 Proof of Proposition 3.2 

Recall the definition of u in (2.4). We shall further denote 

1 < u = min u(i) <u = max u(i) < oo 

i— l,...,p i—l,...,p 



(5.13) 



The inequality u > 1 follows from the fact that H fe has non-negative entries for all /c > 1. 

Observe that, given the initial condition x = (s,£) £ 1 C \_ x , the hazard rate of 
Ai = min(A I 1 , % = 0, . . . ,p) reads 

p 

Hr(«) = ^Hr,( u ) = fi + 1> + l^e"^ , 

where we used (2.6). Moreover, for any i £ {0, l,--- ,p}, we get that, given Ai, the 
conditional probability that the first arrival's mark is i is given by 



P x (7 1 =i|A 1 )=Hr j (A 1 )/Hr(A 1 ). 
Using (2.9), we have X 1 = £c~ f:SAl + /3N ri l{j I ^ }, and thus 
[Q(l z « <8> Vi l7 )](x) 
= E x [cxp ( 7 u T fe^ Al +7/3u T N /l l {/l#0 })] 

exp ( 7 u T &T^ Al ) f> x (/i = i\Ai)e*^-D 



(5.14) 



i=0 



by conditioning on Ai and using that H u = u — l p (see (2.4)) and setting u(0) = 1. 
Inserting Formula (5.14), we get 



[Q(l zl ®I/ li7 )](x)=E x 



j>xp ( 7 u T £e^ Al + 7 0(u(i) - l))Hr i (A 1 )/Hr(A 1 ) 



Using the hazard rate of Ai under P x , the corresponding probability density function is 
given by u i — y Hr(it)e~ Ir ("' on R + , where 

In(t) = f &»(«) du, i > , 
Jo 

Ir(t) = £ Ir*(i) = fit + l> i + ^(1 - e"^) 

Hence we get 

[<9(%®*Vy)](x) = 



(5.15) 



a yvi r te~ p% '-Jx{u) 



^ c 7/3(uW-l) Hr . (l 



i=0 



du 



Using that e^ ) -15 < e 7/3(u(t) - 1) for all i = 1, . . . ,p and the particular form of Hr(0) 
and Hr(i) in (2.6), we easily obtain that 



^c^( u «-%r,( M )< 



i=0 



i=l 
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Consequently, we have 
1 



■[Q(l zi ®Fi, 7 )](x) 



+ f £ e-r«-(0-^(,-)j ^r^y ^ e^ e ^- Ir W-^d U . (5.16) 



Using (5.15), we have 



/ e7 u r *c-^-Ir(u) du = / e ( 7 u r £+^-)(e-^-l)-( M g+ljMo)« du 



Jo 



->0 as i; £ oo , (5.17) 



where we used Lemma A. 2 with a = ju T £ H — p — and /3' = (j,q + lpHo- 
Similarly, applying Lemma A. 2 with j3' = + fi® + iJa*o, we obtain 



1 



Vi„W Jo 



e 7u r &- f! "-Ir( tl )-«/3 dl 



= J™ exp ^( 7 u T ^ + ^)(e" /3u - 1) - 03 + Mo + lp Mo)«j d« 

Therefore, inserting (5.17) and (5.18) in (5.16), we get 

Hmsup— — r[Q(l z . ®V 1)7 )](x)< sup U . 

Now observe that, using (5.13), we have 

< u < u T £/l^£ < u< oo , 

and thus, for 7 > small enough, using a Taylor expansion of the exponential function 
at the origin, we have 

EjUc^W- 1 ^) lj£ + 7/3(u T €-lj£) + C 7 2 l^ 
/3 7 u T ^ + 1 J£ " /3 7 u T £ + 1 It 

< 1 - ^y^^ + C" 7 2 - 1 - /?7 + 0( 7 2 ) , 

where C, C' do not depend on I. Hence taking 71 > small enough, we have for all 
76 (0,7iL 

yp <y/9(u(t)-l) pu\ 

Hence choosing such a 7 and setting 9 = #(7), we get (3.2) for £ out of (0, M) p provided 
that M large enough. Since Vi j7 is bounded on (0,M] P , this achieves the proof. 
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5.3 Proof of Corollary 3.4 

By [5, Theorem (1)], using the definitions and notation of Section 2.1, we have 

lim -N'(1 [0 ,T] ® w) = ^ T (Id p - H) _1 /x a.s. (5.19) 

On the other hand by the law of large numbers for Harris recurrent chains (see [33, 
Theorem 17.1.7]), we have 

1 - 

lim - V w (4) = tt(w <g> 1 r p ) a.s.[P*] . (5.20) 

n— >-oo 77, * — * + 
fc=l 

Here [P*] means that the result holds for any initial distribution on (Iq, Ao). In particular 
taking A = 0, we get that 

lim -N'(l ro f i ® w) = tt(w <g> 1 r p ) , (5.21) 



where T„ is defined as T n in Section 2.3 but for the unconstrained chain. Note that we 
used that w o (0) = 0, so that only the marks in {1, . . . ,p} are counted in the sum of the 
left-hand side of (5.20). Observe now that under the event {Ao = 0}, we have 

n o ([0,T„]) + N'(l [oA] <g> l { i,..., p} ) - n . 

(Recall that no is an independent homogeneous PPP which corresponds to the events with 
marks equal to 0). Since (T n ) is an increasing sequence, we get that it diverges a.s. and 
thus, applying (5.19) with w = l{i,... iP }, we have, under the event {Ao = 0}, 

n^i ^ N ' (1 [0,T n ] ® 1 {1,-,P}) = X P ( W P " H )~Vo a ' S - 

The last two displays yield, under the event {Ao = 0}, 

,. f n 1 

n-yoo n fj% + lj (Idp - N)" 1 /^ 

Applying (5.19), (5.21) and the last display, we get (3.4). 

5.4 Proof of Theorem 3.6 

The case (i) follows from Proposition 3.2. 

We next consider the case (ii) so that ip({s2} x (0,oo) p ) > for some S2 > s* . 
By (3.8) there exists i <S {l,...,p} such that J(i) > 0. Thus, for any m > 1, the 
constant sequence (i, . . . ,i) of length m belongs to A m {s2) (see Definition 1). It follows 
that 4>({s2 + mJ(i)} x (0, oo) p ) > for all m > 1. We shall prove that for all s' > 1 and 
M > 0, 



liminf inf P (s ^ } inf S„ > s' ) > . (5.22) 

s^oo ee(0,M]P \n>l J 

Take s' such that ip({s'} x (0, oo) p ) > 0. Define s = S2+mJ(i) and take m and M > large 
enough so that ^{{s} x (0, M) p ) > and P (s ' e} ( inf S„ > s' ] > for all I e (0, M] p . We 



thus have that the probability that the chain {Xfe, k > 1} avoids {s'} x (0,oo) p starting 
from anywhere in {s} x (0, M] p is positive. Since these two sets have positive ■(/i-measure, 
we get by [33, Theorem 8.3.6] that Q is transient. 
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Hence it only remains to show that (5.22) holds. Applying Corollary 3.4 and Condi- 
tion (3.8), we have 

n 

lim M n = oo a.s.[P*], with M n = J (Ik) ■ 
Hence U = inf M n is valued in R a.s.[P*] (it equals — oo with probability 0). Let e e (0, 1). 

n>l 

Now under the event {So = s}, we have that S n = s + M n as long as S k does not hit the 
constraint set L) p =1 Ai for k = 1, . . . , n. 

For all s > s* , the event {inf S n > s*} coincides with the event {U + s > s*}. Hence, 

n>0 

under this latter event we have inf S n = s + U. Finally, we get that, for all s > s* and 

n>l 

all £e R p + , 

P< s ^ (te S n > s'^j >¥ e {U> (s' Vs')-s) . (5.23) 

(Recall that U only depends on the unconstrained chain (/„, A„), n > 0.) Let us denote, 
for all k > 1, U k = inf M n so that U = M\ A • • • A M k A U k+X . Since M u . . . , M k > -kl, 

n>k 

we have, for all s± > kJ, 

w Sl (£) := V* (U > - Sl ) = F* (U k+1 > - Sl ) = {Q k (w Sl )}(£) . 

Observe that w Sl is a function valued in [0, 1]. By Proposition 3.3 and [33, Section 15]), 
Q admits a stationary distribution fr and there are constants 7, C > and 9 € (0,1) not 
depending on si such that, for all £,£ Q <E and k > 1, 

[Q k {w si )]{l) > {Q k (w Sl )}(£ ) -C6 k (V ln (£) + Vi„(l„)) . 

Now, fix some £ a E and M > 0. Choose k large enough so that 

c = C6 k (V hl (Ml p ) + V hl (£ )) < 1/2 . 

Then we may choose Si large enough so that si > kJ and 

[Q k (w Sl )}(£ ) = P £ ° (U k+1 > -si) > V*° (U > - Sl ) > 1/2 . 

The last four displays yield that, for all £ € (0, M] p , and all si large enough, 

¥ t {U> -si) > l/2-c> 0. 

This and (5.23) yields (5.22), which concludes the proof. 

5.5 Proof of Theorem 3.7 : induction on q 

We have already shown that the chains Q and Q are -0-irreducible and aperiodic and 
exhibited petite sets, see Theorem 3.1. To obtain a drift condition on the constrained 
case q > 1, we shall reason by induction on the number of constraints q. We shall rely on 
the transition kernels Q^~^ introduced in Section 2.3 for subsets J in {1, . . . , q}. For such 
a given set J7\ we shall further denote by {(I n , S n , A„), n > 0} the chain with transition 
kernel Q(~^ which starts at the same state as (/o. So, Ao) and, for all x £ Z^ - *" 7 x 
by E'~' 7 - ) ' x [. . . ] the expectation under {(So,Ao) = x}. (The fact that this expectation 
only depends on x is explained in Remark 1.) 
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Proposition 5.5. Letp,q > 1 and define the transition kernels Q on the space x 
as in Section 2.3. Suppose that H is invertible and that Assumption 1 holds. For any 
non-empty J C {l,...,g}, let {(/„, S„, A„), n > 0} be a Markov chain on the space 
{0, . . . ,p} x x MP, with transition kernel defined in Section 2.3, and suppose 

that it satisfies the following condition. 

(C) For all 71 > small enough, there exists 7g > such that for all 70 G (0,7q], 



lim 



sup 



E (-J),(M) L7o£r = iw(4V li7i (A m -£)] = , (5.24) 



where we denoted w is defined in (5.4-2). 

Then, for all 71 > small enough, there exists 7g > such that for all 70 G (0, 7q], there 
exist #G(0,l) ; o>0 and m > 1, swc/i i/ia£ /or any initial condition x G X. 



Q m Kl ® F lj7l )(x) < ® 7i m )(x) + b 

where Vi l7 and Vq. 7 are defined in (3.3) and (3.6) respectively. 



(5.25) 



Proof. Take x = (s, £) € X. We already have the drift condition (3.2) of Proposition 3.2 
(which holds for some 7 > and thus, by Jensen inequality for any 71 G (0,7)). By 
induction, for all small enough 71 > 0, there exists 61 > and d\ > 0, such that, for all 
m > 1, 

Q m (l zl ® Vl 7l )(x) < ^^(Z) + 6 X (1 + 0i + ■ ■ ■ + (9™) 



< 



Jf^W+ii/fl-ffi) . 



By (2.7) and (5.3) we have, in the event {S = s}, ' (Si) < e 7oJ V^ 9 (s) , then we get 



3f;(S m ) < c^> Jm V %(S ) . Hence, we obtain, for any 7o > 0, 

Q m {V®l ® F li7l )(x) < e^ m <* (s) [0™y li7l (£) + 6 X / (1 - 6 1 )} 



< 



(e^ 1 ) m (^ o ®F 1 , 71 )(x) + 



0l6 7oJm 
1-0! 



O 8 ) 



(5.26) 



Observe that, for a given #1 < 1, we can choose 70 > small enough so that e 7oJ 6»i < 1. It 
follows that, choosing such a 70, for all m > 1 and K > 1, the bound (5.26) implies (5.25) 
for all x = (s,£) G X such that V ® 9 (s) < A. To address the case V$ (s) > A, we shall 
prove the following assertion. 

(D) For all 71 > small enough, there exists 7q > such that for all 70 G (0,7q], there 
exist m > 1, K > 0, b > and 6 G (0, 1) such that (5.25) holds for all x = (s,£) G X 
satisfying ^(s) > A. 

To conclude the proof, we now show that Assertion (D) holds. Let m > 1 and x = (s,£) G 
X. Observe that, by (2.7), for all 70 > 0, we have, in the event {Xo = x}, 



Hence we have, for all x = (s,£) G X, 

q™(v^ o ® f 1j71 )(x) = ^(s)P [c^r.wfy^^^j 

- «7o ® ^1,71 )( x ) E x [e 70 w(/fc Vi, 7l (A m - £) 



(5.27) 
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Now the idea of the proof relies on the fact that if V®^ a (So) is large enough, then it will 
take some time for all the components j in a set J (with cardinal fj^J at least 1) of the 
multivariate spread to reach their constraint set 



Mj) = U A <c?) 



(5.28) 



j=i, 



Up to this time the process defined by {Y^ - " 7 ' = (I n ,S^ J \\„), n > 0}, where S^ - ^ 
is obtained by removing the j-th component of S„ for all j G J , behaves as a Markov 
chain with kernel Q^~^ (defined as Q but without the j-th constraints with j G J ', see 
above). More precisely, we use the following coupling argument. For a positive integer 
to, we denote 



t (j) = 1 + mJ + max(A(j)) . 



(5.29) 



We can then define a Markov chain {(/„, S„, A n ), n > 0} with transition kernel Q^~^' 
and such that in the event {So(j') > s* m {j), j G J}, the sequence {(/„, S n , A n ), n = 
0, 1, ... , to} coincides with {(/„, S^ J \ A„), n = 0, 1, . . . , to}. Then, by construction, for 
all x = (s,£) such that s(j) > s* n (j) for all j G J , denoting x' = {s^~^\l), we have 



3 7oE ™= lW(/)e Vi j7l (A m -£) 



(5.30) 



Now, we take K > such that log/T > jryo max s* n (j). We let denote the set of all 

3=h — il 

indices j G {l,...,q} such that s(j) > s^C?)- The set J is not empty since V®^ o (s) > K 
implies max(s) > max(sj n ). Hence, Equality (5.30) applies for all x = (s,£) G X satisfying 
V®? (s) > K. For 7q as in Condition (C) (we can choose it independently of J since there 



is a finite number of such subsets), we obtain Assertion (D). 



□ 



Condition (C) of Proposition 5.5 needs to be simplified. A first step in this direction 
is given by the following lemma. 

Lemma 5.6. Let {(/«, S n , A„), n > 0} be the Markov chain on the space {0, 1, . . . ,p} x 
ZjJ. x with transition kernel Q defined in Section 2.3. Let w : {0,1,..., p} — > M. 
Suppose that the following assertions hold. 

(E-l) For all 7' > small enough, there exists 7* > such that, for all 7 G (0,7*], Q 
is (l{o,i,..., P } ® <£> Vi,y) -geometrically ergodic, where Vb, 7 and Vi,y are defined 
in (3.6) and (3.3) respectively. 

(E-2) The stationary distribution tt of Q satisfies ir w® l z ^ xR p < 0. 

Then, for all 71 > small enough, there are positive constants 7, p and 7q such that, for 
all 70 G (0,7q], 



SUp SUp ; — Li — < OO . 

xSX m>l 



(5.31) 



Proof. Using Assumption (E-l), for any 7' > small enough, we may choose 7, 71 G (0, 7*] 
such that Q is Iiq ... p} ® ® Ui i7l -geometrically ergodic. Let w = max w(i). Then, 

for any 7* > 0, there 7, 71 > small enough, we have for all j > 1 and x = (s, £) G X, 



e 7o£i =1 w(7 fe ) 8 7li7i)(Xj .)] < e^E* [(F®* ® Ui, 7l )(X J )] 

<Ce^"(Ou 1 , 7l )(x), 



(5.32) 
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where, in the last inequality, we used (E-l) and thus the constant C only depends on 
7, 7i. Let now m, I > 1 and write m = nl + j with j £ {0, . . . , I}. Then, using (5.32), the 
Holder inequality and (5.32) again, we get 



E x 



P 7o Er=i w ( / '=) 



« 9 ®^ 7l )(X m ) 



< Ce 70U,J E x [ e T°^=i w ( /fc ) (V$® Vi i7l )(X n! ) 



= Ce w E x 



< 



Vi=l 
f I 



J To E*=o w ( / '=i+0 



(5.33) 



" W®^,7i)(X(„-i)j+i) 



< C 2 e 2 ^" E x [e^°^o w(/« +4 ) (V r®9 g, 7 li71 )(X (n _ 1)I+i ) 



(5.34) 



Applying [33, Theorem 15.0.1], Q has a unique invariant probability measure 7r and there 
exists R > and k > such that, for all i £ {0, . . . ,p}, s £ and £ £ R+ and all 
positive integer to, 



\\Q m ((i,s,£) r ) - 7r|| (7)7l) < i?e~ K ™ F ^(s) K li71 (*) , 
where for any signed measure £ on {0, . . . ,p} x x R?j_, we set 



IKII 



sup{e( ff ) : ||g/(l { o,..., P }®< 7 J 0^i, 7l )||oo < 1} 



Define 



'(i,s,^)=e^ w ( 1 Vo, 7 (s)y li7l (^) 



for all (i,s,£) £ {0,...,p} x Z« 



Then 



h7(l{o,... lP }®V f«(»Vi l71 )|L <e 



Thus, (5.35) yields that, for all (i,s,£) £ {0, . . x Z q + x R^, 



*((i,s, £),«;*) < ff(u;*) + iie" 



V®«(s)V lni (£) 



Now, by dominated convergence, we have, as a — > 0, 

exp(aw<x> l z « xR p ) - 1 
which is negative by Condition (E-2). Hence there exists a > such that 



fr I w ® Is 



(5.35) 



(5.36) 



6 := 7T ( cxp(aw ® l z a xR p ) ) < 1 



(5.37) 



Now, by dominated convergence, we may decrease 7* so that 7,71 £ (0,7*] implies 
tt(w*) < b 1 !" 1 for all I and 70 satisfying ^70 = a. Then we have for these a and 7,71, 
which set the values of R and k, and for all / and 70 satisfying ^70 = a, 

Q l ((i, S> £) )W *) < (b^ + Re-" 1 e™) V$(s) V hyi (£) . 
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We now denote by L some integer such that b 1/2 + Re~ Kl e aw < b 1/4 for all I > L. Then, 
provided that Z70 = a and I > L, we get 

Q l ((i,s,£),w*) Kb^V^&V^e) . 

Using this bound, by successive conditioning, we get that, for all n > 1 and all 1 < j < I, 
and x = (s,£), if Z70 — a, 

: i7 ' E « w(W) (V®« ® Vi l71 )(X (n _ 1)I+4 )) 

< fcC**- 1 )/*!" [e^owC/,) g 7 lj7i )(Xi))" 

< Ce" 1 fe^" 1 )/ 4 (V®« ® F 1)7l )(x) , 

where, in the last inequality, we used e' 7oW ^ /j - 1 < e alu and Condition (E-l) and thus C is 
some constant not depending on n. Inserting this in (5.34), we get, for m = nl + j with 
j G {0, ■ ■ ■ ,1} and if Z70 = a with I > L, 



E 



3 to z;r=i w ( 7 »=) 



(v^®Vi, 71 )(x„ 

< (73 c 3a«J 6 (n-l)/4 g Vl )71 )(x) 

< CV^o" 1 / 2 6 m /( 4 ') (V®« ® Fi, 7l )(x) 



(5.38) 



Now, this bound holds for any 70 = a/ 1 with I > L, where the constants a and b are 
chosen so that (5.37) holds and L only depends on 0,6,7 an d 7i- This yields (5.31) by 
taking p > such that c p < fe _1 /( 4a ). Note that we have shown that (5.31) holds for all 
70 G (0, a/L] but only if a/70 is an integer. Nevertheless, it is easy to shown that this can 
be extended to all 70 € (0, 7q] by choosing p and 70 adequately. □ 



(C) for the chain 
immediate conse- 
is implied by the 
a simple moment 

Condition (C) of 
(5.31). Again we 



If q = 1 in Proposition 5.5, then we only need to prove Condition 
Q~^\ which then corresponds to q = 0, that is V^ = 1. Hence an 
quence of Lemma 5.6 is that if q = 1 in Proposition 5.5, Condition (C) 
geometric ergodicity of the unconstrained chain Q (see Section 2.3) and 
condition on the stationary distribution tt of this chain, see (E-2). 

However, for q > 2, Lemma 5.6 is no longer sufficient to prove 
Proposition 5.5 because of the presence of V®Jj in the denominator of 
shall rely on an inductive reasoning to obtain the following result. 

Lemma 5.7. Let p,q > 1. Let {(/„, S„, A„), n > 0} be the Markov chain on the space 
{0, 1, . . . ,p} x x with transition kernel Q defined in Section 2.3. For any J C 
{1, . . . , q}, further define the transition kernel Q^~^ on the space {0, . . . ,p}xZ^~* J xK?j_, 
as in Section 2.3. Let w : {0, 1, . . . ,p} — > R. Suppose that for any subset J in {1, . . . , q}, 
the transition kernel Q^~^' satisfies Assumptions (E-l) and (E-2). Then for all 71 > 
small enough, there are positive constants po and 7q > such that, for all 70 € (0,7g], 



sup sup 

(s,£)GX m>l 



e 7o ESLi {w(/ fc )+Po} y ( Am _ £) 



< 00 



(5.39) 



Proof. We prove this lemma by induction on q. 

For q = 0, Lemma 5.6 holds with V®^ = 1, which implies (5.39) for all 71 > small 
enough with some po , 7q only depending on 71 . This provides the case q = which 
initiates the induction. 

Next we show the result in the case q > 1 using the induction hypothesis that the 
result holds for lower values of q. 
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Define for any j = 1, . . . , q, A(j) as in (5.28) and 



s*(j) = 1 + max(A(j)) and s* = max s*(j) . 

j=l,...,q 

If x = (s, i) g X is such that 

s6A = {l,.., S *(l)}x-x{l,.., S *(9)}, 

then, applying Lemma 5.6, for all 71 > small enough, we can choose 7,(0,7* > such 
that, for all 70 € (0,7*], there is some constant Cq > such that, for all m > 1, 



E x 



e 7oE^ 1 w(/ fc )y li7i(Am) 



< E x 



(5.40) 



< C c 9S * 7 e-^ om l/i, 7l (^ . 

We denote by r the first hitting time of the set A, 

t = inf {k > 0, S fc g 3} 

and we set r m = r A to, which is a stopping time with respect to the filtration Tk = 
er(Y n , < n < k), k > 0. We define by J m the set of indices j such that Sk(j) stay away 
of A(j) for all k = 0, 1, . . . , r m , which is a random set measurable with respect to T Tm and 
is non-empty by definition of r m . By definition of r m , we have, in the event {T m < to}, 
S Tm g 3 and thus, by (5.40), setting d = C e 9 ^ 7 , 



E 



e 7o Er_ m+1 w ( J *Vi )71 (A m ) 1 J < Ci c-^t™-^) y li71 (A Tm ) 



(5.41) 



On the other hand, up to n = T m the chain X„ (with transition kernel Q) coincides with 
the chain with transition kernel Q^ Jm ^ that starts at the same state, where J m is defined 
as above. Hence we may write, for all integer I = 1, 2, . . . , m, all non-empty subsets J of 
{!,... ,q} and x = (s, £) 6 X satisfying s ^ A, 



E x 



3 70 EEi w(/ fc ) e -p 70 ( m -. m ) Fi)7i (Arm) l {Tm=i}nUm=J} 

e 70 ELl w(4) c -P7o(m-0 y i)7i (A;) l {Tm=;}n{ Jm=J} 



r(-^)v 



< e -P7o(m-J) E (-J),x L70 EUi (A;) 



where {(/„, S„, A„), n > 0} here denotes the chain with transition kernel Q^~^ which 
starts at the same state as (Iq, So, Aq). Using the induction hypothesis, if 71 > is small 
enough, there are positive constants 7' and p' such that for all 70 g (0,7'], x g X and 
I > 1, 



E (-J),x 



c 7o EU w(/ fc) v (A;) < C2 c -p' 7o I v ( £) ; 



for some constant C2 > independent of Z and x. Inserting this bound in the previous 
display and summing over all / = 0,1,..., to and J, we get, for all 70 g (0,7'] and 
x = (s,£) g X such that s ^ A, 



e 7oE?" 1 w(/ fc ) e -p 7o (m-r m ) V lni (X Tm )] < C 3 ]T e^ 70 ^-'^' 70 l Vl^ (£) 



/=() 
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where C3 > is independent of m and x. This last bound, with (5.41) yields, for all 
70 e (0, 7' A 7*] and x = (s, i) £ X, 



< E x 
<Ci 



,7oEr. Tm+1 w(/ t )y li7i(Am) |^ T 



;7o EI2i w(/ fc ) e -p(m-r,„) yr (X Tm ) 



< C 4 me-^') m yi, 71 (€) , 

where C4 is some positive constant independent of m and x. Now, taking 7q = 7' A 7* 
Po e [0,p hp')., we get (5.39). □ 

Theorem 5.8. Letp, q > 1 and suppose that Assumptions 1 and 2 hold. Let {(/„, S„, A„), n > 
0} be the Markov chain on the space {0, 1, . . . ,p} x x with transition kernel Q de- 
fined in Section 2.3. For any J C {1, . . . , q}, further define the transition kernel Q^~^' 
on the space {0, . . . ,p} x ZjjT*" 7 " x as m Section 2.3. Define 



w(i) 







/or all i = 1, . . . ,p 
for i = 0. 



(5.42) 



Suppose that for any subset J in {l,...,q}, the transition kernel ^ satisfies 
(B-l) For all 7' > small enough, there exists 7* > such that for all 7 € (0,7*], Q^~^' 
is (l{o,i,...,p} <& 7 * ® ^1,7')" geometrically ergodic. 

(B-2) The stationary distribution 7r^~ 0/ Q satisfies k^^' w®1 z ^ xR p < 0. 

Then, for all 71 > small enough, there exists 7q > swcft i/iai /or a// 70 € (0,7q], i/ie 
transition kernel Q is (l{o,i....,p} <8> ^o^? ® ^1,71 )~ geometrically ergodic. 

In practice, Theorem 5.8 should be applied by induction, so that only Conditions of 
the form (B-2) have to be checked out. The case q = is treated in Proposition 3.3. 
We treat the case q = 1 in the following section. Then we explain in Section 3.5 how to 
perform this induction for q > 2. 

Proof of Theorem 5.8. By Conditions (B-l) and (B-2), we may apply Lemma 5.7 to show 
that Condition (C) of Proposition 5.5 holds. Hence, applying this proposition, we get the 
drift condition (5.25). 

The small sets of Theorem 3.1 arc of the form {1, . . . , K} q x (0, M] p . Observe that 
the sublevel sets of Vq 9 7o ® Vi tll 



a 



7o,7l 



(r) = {x e Z 4 



^7o®W x )<r}= r>0 



are included in such sets. Hence the drift condition (5.25) applies to show that the 
chain Q is (Vo, 7o <g> Vi i7l )-geometrically ergodic, see [33, Chapter 15]. The fact that Q is 
(l{o,...,p}®Mj,7o®^i,7i)-g eome trically ergodic follows as in the proof of Proposition 3.3. □ 

Proof of Theorem 3.7. This result is obtained by induction on q using Theorem 5.8 and 
Theorem 3.5 to initiate the induction at q = 1. □ 



5.6 Proof of Corollary 3.8 

We first show that for all D > 0, K > 1 and M > 0, the set C = (0, D] x {0, . . . ,p} x 
{1, . . . , K} q <S) (0, M] p is a petite set. By Corollary 5.4, there exits m > 1, e > and a 
probability measure nu on Y such that {0, . . . ,p} x {1, . . . , K} q ® (0, M] p is an (m, e, £) 
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and (m + 1, e, P)-small set for Q. For all z = (5, i, s,£) G C, under the initial condition 
Zo = z, we have A m _i G (0,M+ (m— It follows that the density of A m is 

bounded from below by coe - *-" 1-1 * 101 ' over t G R + for some positive constants co,ci (see 
the proof of Lemma 5.1). We get that for all Borel sets A C R+ and Be Y, 



Q {z,AxB)>c a e- Clt dtQ m (y,B) 

J A 

> e(co/ci)v(A x S) , 

where we used that {0, . . . ,p} x {1, . . . , K} q <g) (0, M] p is an (m, e, i>) for Q and z/ is the 

probability measure defined by v(A x B) = I c\C~ Clt dt v(B). Hence C is an (m, e, i/)- 
_ 

small set for Q. Similarly, we have that is also an (m + 1, e, £')-small set. Therefore, 
we conclude that Q is ^-irreducible and aperiodic and that all bounded subsets of Z are 
petite sets. 

Next we show that the drift condition obtained in Proposition 5.5 for Q extends to Q. 
As explained in Section 3.5, the assumptions of Theorem 3.7 imply that of Theorem 5.8 
and thus of Proposition 5.5 (see the proof of Theorem 3.7). Hence, for all 71 > small 
enough, there exists 7q > 0, G (0, 1), b > and m > 1 such that for all 70 G (0,7o] such 
that for any initial condition x G X, we have 



Q m (Voao ® ^1,71) W ^ 9 ( y o,7o ® ^1,71 )W + b 



By the Cauchy-Schwartz inequality and using that for all x G X, A m is stochas- 
tically smaller than an exponential distribution with mean (lj/x ) _1 (see the proof of 
Lemma 5.1), we have 



Q [V 2n2 (8 l{o,..., P } ® Vo,7 / 2 8) V r li7l/2 )(A,i,x) 
/ x \ l / 2 / r m 

< (E [^ 272 (A m )]J (l {0 ,...,p } ® ^0,70 

< ^"(l^e^e-^dr) 7 (0 (V 0l70 
Provided that 72 < lJ/n /2, we thus get 

Q m (V" 2 , 72 (8 l{o,...,p} ® Vb, 70 / 2 ® V r i m/2 )(A,i,x) 

< (1 - 2 72 /i> ; 



Vi i7l )j (A,i,x) 

y li71 )(x) + 6) 1/2 



1/2 



,1/2 



(Vb. 



70/2 



Vi )7l/2 )(x)+& 1/a 



Hence, for 72 > small enough, we have, for some 9' G (0, 1) and b' > 0, 

Q m (^2, 72 ® l{0,...,p} ® ^0,70/2 ® Vi, 7l/2 )(A, *: X ) < ^'(^0,70/2 ® V r i, 71 / 2 )(x) + b' 

Hence we obtain a drift condition with an unbounded off petite set function V2, 
l{o,...,p} ® ^0,70/2 ® ^1,71/2- The geometric ergodicity follows (see [33, Section 15] 



72 



5.7 Proof of Theorem (3.9) 

We need the following lemmas to complete the proof for this theorem. Denote the number 
of arrivals on [0,it] by N u = N (l(o,«l <£> l{o,...,p}) an( i the time interval between the last 

arrival before u and it by r u = u — A^ . 

fc=i 
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Lemma 5.9. We have 

r N « 1 FDl 

lim > = , a.s. PJ . 

«^°o u E(Ai) 

Proof. By definition of N„, we have 

N„ N„+l 
fc=l k=l 

Since Q is positive Harris, using [33, Theorem 17.3.2], we get 

1 " — 
lim - VA fc =E(Ai) , a.s. [P.] . 

n — Km r? * — * 



k=l 



We deduce that 



l im i im ^fc=i Afc = i im ^fc=i Afc =E(A0 , a.s. [P„ 



Hence the result. □ 
Lemma 5.10. For all T € R+, we /iawe 

sup r x T = op^ (v't) , a.s. [P*] . 
re [o,i] v y 



Proof. Let us define 



E(Ai) 



(5.43) 



By definition of T u , for all r S [0, 11, we have that T t t < max A k . 

fe=l,...,N r +l 

For any r <G [0, 1], e > and T e 1 + , we get, for all x e X, 

Y tT > cVt) < P x ( max A k > eVr^J + P x (N T + 1 > 2n T ) . 
Using Lemma 5.9, we have lim P x (Nj- + 1 > 2ut) = 0. Observe that we have, for all 

T— >oo 

k > 1, Afc < E k where {Pi}i>i are the inter-arrivals of a PPP with intensity Ipi. We 
get, for all x 6 X, 

lim P x [ max A k > eVf) < lim P x [ max E k > eVf) = , 

T->oo \k=l,...,2n T J T^oo \k=l,...,2n T J 

which concludes the proof. □ 

We now provide a proof of Theorem 3.9 using Lemmas 5.9 and 5.10. 
Proof of Theorem 3.9. For all t e [0, 1], T € K + and x e X, we have 

(N X (1 [CMT] ® w) - tT P(w)) = J2 (w(4) - E(w)A k ) - P(w)r x T . 

fc=i 

We also observe that, defining ut as in (5.43), 

|>(4)-p(w)A fc ) = s riT (^k^= SnT (^,g) , 
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where we used that, by definition of E(w), n(g) = and that N t r is an integer. Hence 
by Lemma 5.10, we get 

(N x (% tT] <g> w) - tT E(w)) = s nT (j^r,9) + op. WT) . (5.44) 

Now, by Lemma 5.9, we have, for all t G [0, 1], N^/ny — > t as T — > oo a.s.[P*]. By the Dini 
theorem, this convergence holds uniformly over t G [0, 1]. Applying [40, Theorem 13.2.1], 
this convergence and the one in (3.15) implies that, for any initial distribution, 

This, (5.44), a 2 g = v(w) and the fact that n T /T -> l/E[Ai], we get (3.18), which achieves 
the proof. □ 



6 Conclusion 

In this paper we have introduced and studied constrained multivariate Hawkes processes. 
The constraints are expressed using a multidimensional constraint variable, whose evo- 
lution is driven by the point process. Under the markov setting (exponential fertility 
functions), we have proven that the underlying Markov chain is ^-geometrically crgodic 
under some conditions on the parameters. A converse result leading to the transience of 
the chain in the case where an univariate constraint variable is used (q = 1) illustrates the 
sharpness of our conditions. Moreover, in the general case, we used a functional central 
limit theorem applying to the chain to derive the scaling limit of the integrated point pro- 
cess in physical time. Finally we have briefly explained how the constrained multivariate 
Hawkes process can be applied to model the dynamics of a limit order book. The scaling 
limit of the mid-price can be deduced from our findings. 

We only presented the case of the order book of one asset with only two limits. This 
is clearly not a restriction. Multi-assets limit order books can be considered, yielding 
multivariate boundary conditions (hence q > 2). In a forthcoming paper, we will use this 
model to provide an empirical study from real data and discuss the potential application 
of this model to the dynamics of limit order books. We believe that the applicability of the 
constrained multivariate Hawkes process extends well beyond financial applications, as it 
could model the motion of an object on a discrete net with some boundary conditions. 
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A Additional technical lemmas 

The following result is used in the proof of Proposition 5.3. 

Lemma A.l. Letp > 1, andT is apxp invertible matrix. Then there exists a probability 
measure v such that, for all M > 0, 7 > 0, there exists e > such that for all g : W — > R + 
and i G (0, M] p , 

J g(u£ + ri9)it 7 dudi9 > e J g dv , 
where D = {(it, ■&) G (0, 1] x (0, If, < u < 0(1) < ■■■ < 0(p)}. 
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Proof. Setting oj = u£ + T-d, we have 

/ g(ul + ri?)u 7 dwdi9 = — — / g(w)tPl D ((u,T- 1 u - itr -1 *))dtidw . 
J d det r J 

We choose any < rj < l/(p + 1) so that the open set -B,, = {i9 G (0, l] p : r\ < 
0(1), 77 + 0(1) < 0(2), ■• • ,»/ + 0(p - 1) < 0(p),0(p) < 1 - 77} is not empty. 
Denoting Ti?^ = {T0 : e 5,,} and setting 



?/ = — ?/min I 1, I max sup I [r 

wc have that for all u e (0,r/] 5 t E (0,M] P and w G r_B^. [<it,r _1 w - ur _1 £] e D. Wc 
obtain that 

y ,g(u£ + r0)u 7 dud0 > — j-^ y g(u>) ^ ' w 7 dw^j do; 

( 7 + l)detr J J ' 
where [i Leh is Lebesgue measure, v is the uniform probability measure on TB V . □ 

The following result is used in the proof of Proposition 3.2. 
Lemma A. 2. Let ft > and ft > 0. Then, as a — y oo, 

e a(.-"-lH't d( _ >0i (A1) 

// moreover ft' > /3, £/ien we have the following asymptotic equivalence as a oo, 

e^^'-^-^dt ~ — . (A.2) 
ap 

Proof. Setting d = c - ' 3 *, we get 

/*°° / 1 z 1 ! / 

/ e a ^ t - 1 )(e-^)Tdt=i / e-^-^tf^-Mi? . 
Jo P io 

Letting a — > 00, by dominated convergence, we get (A.l). 
Now we set uj — —a{d — 1) and obtain that 

P Jo a/3 J a 

Letting a — > 00, we obtain (A.2) by dominated convergence. □ 
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